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Site-Soecific Recombination in.EukarynfP.g 
and Constructs Useful Therefor 

FIELD OF THE INVENTION 

The present invention relates to methods for 
manipulating chromosomal sequences in cells by site- 
specific recombination promoted by recorabinases . In a 

5 particular aspect, the present invention relates to methods 
for producing embryonic stem cells bearing nucleic acid 
sequences that have been rearranged by a site-specific 
recombinase expressed from a construct controlled by a 
tissue-specific promoter (e.g. , a germline specific 

0 promoter). In another aspect, the present invention 
relates to methods for producing embryonic stem cells 
bearing nucleic acid sequences that have been rearranged by 
a site-specific recombinase expressed from a construct 
controlled by a conditional promoter. 

5 BACKGROUND OF THE INVENTION 

The analysis of gene function has increasingly come to 
require the. production of subtle, tissue-specific, and 
conditional mutations in animals and., plants. Although 
there are. a number of methods, for engineering subtle 

0 mutations in embryonic stem (ES) cells (Hasty et al . (1991) 
Nature 350 : 243 -246 , Askew et al . (1993) Mol Cell Biol 
13 : 4115 -4 124: V, the use of site- specif ic recombinases to 
remove the selectable marker that permits isolation of 
homologous ly recombined ES cell clones has become 

5 increasingly prevalent (Kitamoto et al . (1996) Biochem 
Biophys Res Cowmun 222:742-747, Fiering et al . (1993 ) Proc 
Natl Acad Sci U S A 90 :8469-8473, Schwenk et al . (1995) 
Nucleic Acids Res 23:5080-5081; Gu et al . (1993) Cell 
73:1155-1164; Sailer et al . (1996) Taniguchi Symposia on 
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Brain Sciences, eds. Nakanishi et al . (Japan Scientific 
Press) , pp. 89-98) . 

Site-specific recombinases represent the best method 
for creating tissue-specific and conditional mutations in 
5 animals and plants, being employed first to remove the 
selectable marker to create a functionally wild- type 
allele, and then to inactivate the allele mosaically in 
animals and plants by removing some essential component in 
a tissue-specific or conditional manner (Gu et al . (1994) 

10 Science 265:103-106; Kuhn et al . (1995) Science 
269:1427-1429). Current protocols for using excissive 
site- specif ic recombination to remove selectable markers 
include transiently trans feet ing ES cell clones with a 
recombinase expression vector (Gu et al . (1993) Cell 

.5 73 : 1155-1164) , microinjecting fertilized oocytes containing 
the recombinant allele with a recombinase expression vector 
(Ki tamo to . et al . (1996) Biochem Biophys Res Comwun 
222:742-747; Araki et' at. (1§95) Prdc Natl Acad Sci U S A 
92:160-164) , or breeding animals and plants containing the 

0 recombinant allele to animals and plants, respectively, 
containing a recombinase transgene (Schwenk et al . (1995) 
Nucleic Acids Res 23:5080-5081; Lewandoski et al . (1997) 
Curr Biol 7 :148-151) . Each of these approaches requires an 
investment of some combination of time, resources, and 

5 expertise over that required to generate animals and plants 
with homologously recombined alleles . The most commonly 
employed method, the secondary transfection of homologously 
recombined ES cell clones with a recombinase expression 
vector, additionally requires extended culture time that 

0 may decrease their potential to enter the germline . 

In principle, marker excision would be substantially 
simplified through the use of ES cells containing 
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recombinase nucleic acid constructs that were expressed in 
the germline, but not to an appreciable extent in the ES 
cells themselves or somatic tissues of animals and plants. 
The lack of ES cell expression would mean that targeting 
5 vectors containing selectable markers flanked by 
recombinase target sites could be used to isolate 
homologous recombinants without fear that the marker would 
be excised during culture. Robust recombinase expression 
in gametes would mean that the marker would be excised in 

10 at least some of the progeny of ES cell chimeras. Only a 
single step would be required to isolate subtle mutations 
and, if two different recombinase systems were employed, 
conditional and tissue-specific alleles could be produced 
with similar improvements in .efficiency. A 

15 gejrmline- specif ic recombinase nucleic acid construct could 
also be used to deliver recombined target nucleic acid 
. . constructs to. the early: embryo (Lewandoski et al. (1997) 
Curr Biol 7 :148-151) , so long as the recombined target was 
not detrimental to the terminal stages of spermatogenesis . 

20 Previous reports have shown that expression of • nucleic 

acid constructs containing' the proximal promoter of the 
mouse protamine 1 (mPl) locus is restricted to haploid 
spermatids in mature mice (Peschon.et al . (1987) Proc Natl 
Acad Sci U § A 84 : 5316-5319 ; Behringer et al . (1988) Proc 

25 Natl Acad Sci U S A 85: 2648-2652) , although low levels of 
ectopic expression may occur in some mature tissues 
(Behringer et al . (1988) Proc Natl Acad Sci USA 
,85:2648-2652). Inclusion of the mPl promoter does not 
guarantee expression in the male germline, however, for 

30 although nucleic acid constructs containing: the mPl 
promoter and the SV40 T- antigen coding sequence were 
transcribed, the message was not translated at detectable 
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levels in spermatids (Behringer et al . (1988) Proc Natl 
Acad Sci USA 85:2648-2652) . ■ ■ 

Accordingly, there is a need in the art for methods to 
modulate expression of recombined target nucleic acid 
sequences in the early embryo* In addition, there is a 
need in the art for tissue-specific and conditional 
recombinatory tools to create transgenic animals and 
plants. These and other needs in the art are addressed by 
the present invention. 



10 BRIEF DESCRIPTION OF THE INVENTION 

The present invention meets the need in the art for 
modulating expression of recombined target nucleic acid 
sequences to the early embryo. The present invention 
further meets the need in the art for tissue -specific and 

15 conditional recombinatory tools to create transgenic 
animals and plants. Thus, in accordance with the present 
invention, it has been discovered that nucleic acid 
constructs encoding a germline specific promoter 
operatively associated with a recombinase coding sequence 

20 lead to efficient recombination of a target nucleic acid 
construct in the . male germline, but not in other tissues. 
This suggests that such nucleic acid constructs could be 
used for the efficient production of embryos bearing 
conditional, genetically lethal alleles . it has 

25 additionally been discovered that ES cell lines generated 
from one of these transgenic lines could be used in 
combination with targeting vectors that contained 
loxP- flanked selectable markers to isolate homologous 
recombinants containing the marker and functional loxP 

30 sites. 
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•' 5 

BRIEF D ESCRIPTION OF THE FIOTTRRg 

Figure 1 illustrates a schematic of P2Bc and P2Br 
- alleles. The positions of the PCR primers used (5-p and 
v. 3' P). are indicated on the diagrams of the P2Bc and P2Br 
5r. alleles. 

Figure 2 depicts the targeting of the hoxb-1 locus in 
ProCre ES cells using a targeting vector that contains a 
loxP-flanked selectable marker. Top, schematic of the 
wild-type hoxb-1 locus showing the positions of the two 
exons (open boxes), the position of a 5' Nrul site and 
flanking BamHI restriction endonuclease sites, and PCR 
- primers (triangles) that amplify a 204 bp product from the 
wild-type allele that includes the Nrul site . Middle, the 
predicted organization of homologously recombined hoxb-1 
15 - allele in which -a neomycin cassette (NEO)., flanked by loxP 
: ., sites (L) , has been inserted into the Nrul site shown in 
the top diagram. The insertion creates a novel BamHI site 
and the same PCR primers now amplify a 1600 bp product. 
Bottom: the predicted structure of the recombined allele 
shown in the middle panel after Cre-mediated excision of 
the neomycin cassette to leave a single loxP site in place 
of the Nrul site of the wild-type allele.- Amplification 
with the same primers now yields a 268 bp product. 

DETAILED DESCRI PTION OF THE TWVRHTT ftM 

25 In. accordance with the present invention, there are 

provided, nucleic acid constructs comprising a germline- 
specif ic promoter operative ly associated with a recombinase 
coding sequence . : ■ . 



20 
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As used herein, the term "promoter" refers to a 
specific nucleotide sequence recognized by RNA polymerase, 
the enzyme that initiates RNA synthesis. The promoter 
sequence is the site at which transcription can be 
5 specifically initiated under proper conditions. The 
recombinase nucleic acid'(s), operatively linked to the 
suitable promoter, is (are) introduced into the cells of a 
suitable host, wherein expression of the recombinase 
nucleic acid(s) is (are) controlled by the promoter. 

10 Germline-specif ic promoters contemplated for use in 

the practice of the present invention include the protamine 
1 gene promoter, the protamine 2 gene promoter , the 
spermatid-specif ic promoter from the c-kit gene (Albanesi 
et. al. (1996) Development 122 (4) : 1291-1302 ) , the sperm- 

15 .specific, prompter from angiotensin- converting enzyme 
• (Howard et a.±: tl9 93) Mol.: Cell Biol 131 Cl)>: 18 -27 ; 'Zhou et 
al (1.9 9.5 ) . Dev Gene t- 16 ( 2 ) 1 2 01 -2 0 9) oocyte ; ^ Specific 
promoter from the ZP1 gene, oocyte specific promoter" from 
the ZP2 gene, oocyte specific prompter from the ZP3 gene 

20 (Schickler et al. (1992) Mol Cell Biol 12 (1) : 120-127) , and 
the like . 

In addition to the above -described germline-specif ic 
promoters, tissue- specific promoters specific to plants are 
also contemplated for use in the practice of the present 

25 invention, including, for example , the LAT52 gene promoter 
from tomato, the LAT56 gene promoter from tomato, the LAT59 
gene - promoter from tomato: Eyal et al. (1995) Plant Cell 
7 (3) ; 373-384) , the pollen- specific promoter of the Bra:ssica 
S locus glycoprotein gene (Dzelzkalns et al (1993) Plant 

30 Cell 5 (8) : 855-863 ) , the pollen-specific -promoter of the 
NTP3 03 gene (Weterings et al . (1995) Plant J" 8(l):55-63), 
and the like. 
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Recombinases contemplated for use in the practice of 
the present invention include Cre recombinase, FLP 
recombinase, the R gene product of Zygosaccharomyces 
■ (Onouchi et al . (1995) Mol Gen Genet 247 (6) : 653-660) , and 
5 the like. 

Presently preferred constructs contemplated for use in 
< the practice of the present invention include ProCre 
(comprising the protamine 1 gene promoter operatively 
associated with Cre. recombinase), ProFLP (comprising the 
10 protamine 1 gene promoter operatively associated with FLP 
recombinase), ProR (comprising, the protamine 1 gene 
-* f promoter operatively associated with the R gene product of 
Zygosaccharomyces) , and the like. 

. In accordance with another embodiment of the present 

15 invention, there are, provided nucleic acid . constructs 
: comprising a conditional promoter or a .tissue- specif ic 
promoter operatively associated with, a; recombinase coding - 
sequence. 



20 



Promoters contemplated for control of expression of 
recombinase nucleic acid(s) employed in accordance with 
this aspect of the present invention include inducible 
(e.g., minimal CMV promoter, minimal TK promoter, modified 
MMLV LTR) ,. constitutive (e.g., chicken p-actin promoter,- 
MMLV LTR (non-modified), DHFR) , and/or tissue specific 
25 promoters. 

Conditional promoters contemplated for use in the 
practice of the present invention comprise transcription 
regulatory regions that ? f unction, maximally to promote 
transcription of mRNA under inducing conditions. Examples 
30 of suitable inducible promoters include DNA sequences 
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corresponding to: the E* coli lac operator responsive to 
IPTG (see Nakamura et al . , Cell, 18:1109-1117, 1979); the 
metallothionein promoter metal -regulatory-elements 
responsive to heavy-metal (e.g., zinc) induction (see Evans 
5 et al., U.S. Patent No. 4,870,009), the phage T7lac 
promoter responsive to IPTG (see Studier et al . , ATeth. 
Enzymol., 185: 60-89, 1990; and U.S. #4,952,496), the heat - 
shock promoter; the TK minimal promoter; the CMV minimal 
promoter; a synthetic promoter; and the like. 

Exemplary constitutive promoters contemplated for use 
in the practice of the present invention include the CMV 
promoter, the SV40 promoter , the DHFR promoter, the mouse 
mammary tumor virus (MMTV) steroid- inducible promoter, 
Moloney murine leukemia virus (MMLV) promoter, elongation 
factor la (EFla) promoter, albumin promoter, APO Al 
promoter, cyclic AMP dependent kinase II (CaMKTI) promoter,, 
keratin promoter, CD3 promoter, immunoglobulin light or 
heavy chain promoters , neurofilament promoter, neuron 
specific enolase promoter, L7 promoter, CD2 promoter, 
myosin light chain kinase promoter, HOX gene promoter, 
thymidine kinase (TK) promoter, RNA Pol II promoter, MYOD 
promoter, MYF5 promoter, phophoglycerokinase (PGK) 
promoter, Stfl promoter, Low Density Lipoprotein (LDL) 
promoter, chicken (J-actin promoter (used in conjunction 
with ecdysone response element) arid the 1 ike . 

As readily understood by those of skill in the art; 
the term "tissue specific" refers to the substantially 
exclusive initiation of transcription in the tissue from 
which a particular promoter, which drives expression of a 
30 given gene, is derived (e.g., expressed only in T-cells, 
endothelial cells, smooth muscle cells, and the like) . 
Exemplary tissue specific promoters contemplated for use in 
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the practice of the present invention include the GH 
promoter, the NSE promoter, the GFAP promoter, 
neurotransmitter promoters (e.g., tyrosine hydroxylase, TH, 
3 choline acetyltransf erase, ChAT, and the like) , promoters 
5. for neurotropic factors (e.g., a nerve growth factor 
promoter, NT-3, BDNF promoters, and the like), and so on. 

In accordance with yet another embodiment of the 
present invention, there are provided embryonic stem cells 
containing a nucleic acid construct as described herein. 

10 As readily understood by those of skill in the art, 

- the above -described constructs can be introduced into a 
-i variety of . animal species, such as, for example, mouse, 
^ rat rabbits, swine, ruminants (sheep, goats and cattle), 
.4, humans, poultry, fish, and the like. Transgenic 
15 amphibians, insects, nematodes, and the like, are also 
contemplated. Members of the plant kingdom, such as, for 
example, transgenic mono- and dicotyledonous species, 
including important crop plants, i.e., wheat, rice, maize, 
soybean, potato, cotton, alfalfa, and the like, are also 
20 contemplated. 

For example, pluripotential ES cells can be derived 
from early pre- implantation embryos;/ preferably the ova are 
harvested between the eight-cell and blastocyst stages . ES 

25 cells are maintained in culture long enough to permit 
integration of the promoter-recombinase nucleic acid 
construct (s) . The cells are then either injected . into a 
host blastocyst, i.e., the blastocoel of -.. the host 
blastocyst, or co-cultured with eight-cell to morula-stage 

30 ova, i.e., zona-free morula, so that transf ected ES cells 
are preferentially incorporated into the inner cell mass of 
the developing embryo. With blastocyst injection, 
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transgenic offspring are termed "chimeric, " as some of 
their cells are. derived from the host blastocyst and some 
transfected ES cells. The host embryos are transferred 
into intermediate hosts or surrogate females for continuous 
5 development. 

The transformation procedure for plants usually relies 
on the transfer of a transgene carrying a particular 
promoter construct via the soil bacterium Agrobac Cerium 
tumefaciens. Transformation vectors: for this procedure are 

10 derived from the T-DNA of A. tumefaciens, and transgenes 
are stably incorporated into the nuclear genome . The 
activity of the transgenes can then be monitored in the 
regenerated plants under different conditions. In this 
way, many promoter elements that are involved in complex 

15 regulatory pathways such as light responsiveness or tissue, 
specificity have been defined. : - . 1 - 



20 



Alternatively, direct (i.e., vectorless) gene 
transfer systems are also contemplated including chemical 
methods, electroporation, microinjection, biolistics, and 
the like. Protoplasts isolated from the plants can be 
obtained by treatment with cell wall degrading enzymes. 
DNA can be introduced into plant protoplasts by a number of 
physical techniques including electroporation and 
polyethylene glycol treatment in the presence of MgCl 2 . 
2 5 The method of choice for rapid promoter analyses in plants 
is the biolistic method. This technique involves the 
delivery of the particular DNA construct into plant cells 
by microprojectiles, i.e., nucleic acid(s) coated or 
precipitated by tungsten or gold. This method is not 
limited to any particular plant species or tissue type. 
Preferably, this method would allow quantitative analysis 



30 
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of transformation if appropriate selectable markers are 
included. 

In a preferred embodiment, the genome of embryonic 
stem cells according to the invention comprise a 
5 transcriptionally active selectable marker flanked by two 
-recombination target sites. It is especially preferred 
that the recombinase encoded by the recombinase coding 
sequence operatively associated with a germline -specific 
promoter is selective for the recombination target sites 
10 flanking said selectable marker. 

vi Optionally, embryonic stem cells according to the 

.invention may further comprise one or more of: 
7 a nucleic acid fragment flanked by two recombination 

■^ ar 9 et sites, wherein said recombination target sites are 
15 different than the recombination target sites which flank 
said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence, 

20 a second nucleic acid construct comprising a tissue- 

. . specific promoter operatively associated with, a second 
recombinase coding sequence, or the like. Preferably, the 
second recombinase coding sequence will be different than 
the first recombinase coding sequence. 

25 The ability to select arid maintain nucleic acid 

constructs in the host cell is an important aspect of an 
expression system. The most common type of selectable 
marker incorporated in the nucleic acid construct is an 
antibiotic resistance element allowing selection with 

30 ampicillin, kanamycin, neomycin, tetracycline, hygromycin, 
puromycin, blastophycin, and the like. Other approaches 
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employ specially constructed host cells which require the 
selectable marker for survival . Such selectable markers 
include the valine tRNA synthetase, val S, the 
single -stranded DNA binding protein, ssb, thymidine kinase, 
5 or the like. Alternatively, naturally occurring partition 
systems that maintain copy number and select against 
plasmid loss is also contemplated. An example is the 
incorporation of the parB locus. Other selectable markers 
include HPRT and the like. 

10 Selectable markers specific for plants include, the 

gus A (uid A) , the bar gene, phosphinothricin and the like. 

In accordance with still another embodiment of the 
present invention, there are provided methods for excission 
of the transcriptionally active selectable marker from the 
15 above -described embryonic stem cells , said method 
comprising : 

passaging the; genome derived from said embryonic stem 
cells through gametogenesis (i.e., spermatogenesis or 
oogenesis) . 

2 0 - Excission of - marker as - contemplated herein can cause 

a variety of end results, e.g. , deletion of the marker or 
a nucleic acid sequence, gain of function or loss of 
function, replacement of function^ and the like, as well as 
modulation of any one or more of these results. 

25 Functions which are contemplated to be manipulated 

include regulating body size and growth rate, including 
recombining gene constructs which contain various growth 
hormone gene sequences. Other productivity traits that are 
targets include altering the properties or proportions of 

30 caseins, lactose, or butterfat in milk, increased 
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resistance to viral and bacterial diseases (i.e., 
"constitutive immunity" or germ-line transmission of 
specific, recombined antibody genes) , more efficient wool 
^production, and the like. Other functions which are 
5 •; contemplated to be modulated include development of lines 
;of transgenic animals and plants for use in directing 
expression of transgenes encoding biologically active human 
. proteins. 

Agronomic traits which are contemplated to be 
modulated by use of the present invention include tolerance 
to biotic an abiotic stress, increased resistance to 
herbicides, pest damage, and viral, bacterial , and fungal 
r diseases, improvement of crop quality {i.e. , increase in 
nutritional value of food and feed) , reduction of post- 
15 .^harvest losses, improvement of suitability and enlargement 
-of the. spectrum for processing (i.e. , altered quantity and 
composition of endogenous properties , . production of hew 
compounds' of plant or non-plant origin such as biopolymers 
or pharmaceutical substances) . 

20 In accordance with a still further embodiment of the 

present invention, there are provided methods for the 
production of recombinant alleles, said method comprising : 
introducing a nucleic acid fragment flanked by at 
least, two., recombination target sitess into- embryonic stem 
25 cells as described herein,: and ; , 

passaging the genome derived from said embryonic stem 
cells through gametpgenesis . . ir ^- 

As readily recognized by those of skill in the art, 
nucleic acid fragments can be introduced into ES cells by 
30 a variety of techniques, e.g., by homologous recombination, 
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random insertion, retroviral insertion, site specific- 
mediated recombination, and the like. 

Nucleic acid fragments contemplated for use herein 
include fragments containing an essential portion of a gene 
5 of interest . ■ 

In accordance with yet another embodiment of the 
present invention, there are provided methods for the 
production of recombinant alleles, said method comprising: 

introducing at least one recombinase responsive construct 
10 into embryonic stem cells as described herein, 

wherein said construct (s) comprise (s) a nucleic 
acid fragment and a selectable marker, 

wherein said selectable marker is flanked by a 
first pair of recombination target sites, and 
15 wherein said nucleic acid fragment is flanked by 

a second pair of recombination target sites, 

passaging the genome derived from said embryonic stem cells 
through gametogenesis . 



2 0 



In a presently preferred aspect, the first pair of 
recombination target sites is recognized by a recombinase 
which is expressed under the control of a germline- specific 
promoter and said second pair of recombination target sites 
is recognized by a recombinase which is expressed under the 
control of a conditional promoter or a tissue specific 
25 promoter. 

Optionally, the embryonic stem cells employed herein 
can further comprise a second nucleic acid construct 
selected from constructs comprising a conditional promoter 
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operatively associated with a. recotnbinase coding sequence, 
a construct comprising a tissue-specific promoter 
operatively associated with a recombinase coding sequence, 
-and the like. 

5 In accordance with still another embodiment of the 

present invention, there are provided methods for the 
conditional assembly of functional gene(s) for expression 
in eukaryotic cells by recombination of individual inactive 
gene segments from one or more gene(s) of interest, 

wherein each of said segments contains at least one 
recombination target site, and 
x wherein at least one of said segments contains at 

T least two recombination target sites, 

;< r said method comprising: 

15 introducing, said individual inactive gene 

segments into an embryonic stem cell as described 
herein, thereby providing a DNA which encodes a 
functional gene of interest, the expression product of 
which is biologically active, upon passage of the. 

20 genome derived from said stem cells through 

game togenes is . 

For assembly of functional genes from inactive gene 
segments, see, for example, US Patent No. 5,654,182, 
incorporated herein by reference in its entirety. 

25 In. accordance with a still Jfurther embodiment of the 

present invention, there are provided methods for the 
generation of recombinant livestock, said method 
comprising: 

combining embryonic stem cells that include nucleic 
3 0 acid construct according to the invention with host 
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pluripotential ES cells derived from early pre -implantation 
embryos, and 

introducing these combined embryos into a host female 
and allowing the derived embryos to come to term. 

In accordance with yet another embodiment of the 
present invention, there are provided methods for the 
generation of recombinant plants, said method comprising 
transforming plant zygotes with nucleic acid constructs 
according to the invention and allowing the zygote to 
develop. 

The objective of the current work with ProCre nucleic 
acid constructs was to determine the potential of 
germline -specific promoters to implement efficient 
15 approaches utilizing site-specific recombinases to generate 
. an array of sophisticated mutations in mammals and plants J 
The data shows that it is possible to create recombinase 
nucleic acid constructs that are expressed at high levels 
in the germ line but not to a functionally significant 
20 extent in either ES cells or embryonic or adult somatic 
tissues. Homologous recombinants with a selectable marker 
can be isolated in ES cells that contain 
promoter- recombinase nucleic acid constructs. Transgenic 
animals and plants bearing t he p r omo t e r - r e comb inase nlic 1 e i c 
25 acid constructs arid a target « .'allele tranismi t thevttect)mbined 
target to their progeny at high frequencies . These results 
• , establish the principle that mammals and plants containing 
loci that have been homologously recombined arid then 
subsequently site-specifically recombined can be generated 
30 simply by using ES cells with a suitable recombinase 
nucleic acid constructs for the initial targeting. By this 
mechanism, alleles containing a single recombinase target 
site and a mutation of interest can be produced in the 
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progeny of ES cell chimeras without any investment of time, 
expertise, or resources over that required to create an 
allele that still contains a selectable marker. The 

^paradigm has obvious utility in the production of subtle 
5 and conditional mutations that require generation of 
alleles with minimal structural alterations. Because the 

^presence and transcriptional activity of selectable markers 
can contribute to phenotypes in an unanticipated and 
unwanted manner (Fiering et al . (1995) Genes Dev 
10 9:2203-2213); Olson et al . (1996) Cell 85:1-4)/ the 
approach will also useful for generating null alleles. 

.v • Expression of the endogenous mPl locus (Hecht et al . 
,(1986) Exp Cell Res 164:183-190), and mPl-driven nucleic 
acid constructs (Behringer et al. (1988) Proc Natl Acad Sci 

15 A 85 : 2648-2652; Braun et al . ( 198 9 ) ,2\7a ture 337 : 373 -376 ; 

Zambrowicz et al./. (1993 ) Proc Natl Acad Sci U S A 

■: : ; ; 90 : 5071^5075;); is. restricted to • haploid> spermatids . 
Expression of mPl nucleic acid construct expression 
typically begins at haploid stages, and both RNA (Caldwell 

20 and Handel (1991) Proc Natl Acad Sci USA 88:2407-241,1) 
and proteins (Braun et al . (1989) Nature 337:373-376) 
diffuse through the spermatogenic syncytium. The result is 
a highly efficient recombination of target alleles and the 
segregation of recombinase and target nucleic acid 

25 constructs in the first generation. . 

Cre-mediated recombination proved to be highly testis- 
specific in ProCre mice. It is clear that the nucleic acid 
constructs are not expressed in the inner cell mass or in 
other early embryonic tissues. Cells from pre -implantation 
30 embryos intermingle extensively and the embryo as a whole 
is derived from a small number of cells (Beddington et al. 
(1989) Development 106:37-46; Soriano and Jaenisch (1986) 
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Cell 46:19-29). If ProCre nucleic acid constructs 

recombined target sequences during pre- implantation stages, 
at least a few percent of the cells in many tissues would 
contain the P2Br allele and Southern and PCR analyses 
5 showed that this was not the case. The ectopic Cre 
activity seen in some ProCre strains probably resulted from 
low levels of recombinase expression in later embryos or 
mature tissues, a finding consistent with the expression 
patterns of other mPl-driven nucleic acid constructs. 

10 Northern analyses have failed to reveal the expression of 
mPl -containing nucleic acid constructs in a variety of 
mature tissues (Peschon et al. (1987) Prop Natl Acad Sci 
USA 84:5316-5319; Behringer et al . (1988) Proc Natl Acad 
Sci USA 85 : 2648-2652 ; Peschon et al . (1989) Ann N Y Acad 

15 Sci 564 :186-197; Zambrowicz et al . (1993) Proc -Natl Acad 
Sci U S A 90 :5071-5075) , but nucleic acid constructs 
containing the mPl promoter and the SV4 0 T-aritigen v Ted to 
the consistent development of tumors of the petrosal bone 
and right cardiac atrium (Behringer et . al . ( 1988) Proc Natl 

20 Acad Sci US A 85 : 2648-2652) . 

PCR assays represent a very sensitive assay for 
whether sufficient levels of Cre protein were produced to 
effect recombination. Importantly, they measured the 
cumulative level of recombination, for events that occurred 

25 at any stage of development are likely to have been 
propagated to, and might be amplified in, descendant 
populations. The highest level of ectopic recombination 
was that observed in cardiac ventricular tissue of 
strain which generated a signal approximately equivalent to 

30 that expected if the ratio between recombined and 
unrecombined alleles were 1:104. The activities observed 
in other strains were considerably lower than this, and one 
strain did not show any ectopic activity. None of the 
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strains showed evidence of recombination in the cardiac 
atria and the petrosal bone was not examined. These assays 
did not rule out the possibility that higher levels of 

^recombination occur in tissues that were not examined or 
5 that the low levels of recombination observed in some 

.tissues reflected high levels of recombination in some 

-component cell population* 

These low levels of ectopic activity suggest that 
mpl - driven recornbinase nucleic acid constructs could be 
10 used for the production of embryos containing genetically 
lethal alleles. Some alleles created by homologous 
recombination in ES cells will prove to be lethal in 
heterozygotes, as was the case for an mRNA editing mutation 
,o£ the GluR2 glutamate receptpr subunit (Brusa et al . 

15 4(1995) Science 270:1677-1680):- Germline transmission would 
-be restricted to: rare chimeras in which the level of 
chimerism was x low . enough in- t issues affected ' by' the 
mutation to allow survival and high . enough in the germline 
to allow transmission. This problem could be circumvented 

20 by creating recombinase-conditional mutations in ES cells 
bearing mpl -recombinase nucleic acid constructs, or by 
making the same mutations in standard ES cells and then 
introducing the mpl -recombinase nucleic acid construct by 
breeding. So long as ithe recombined version of the allele 

25 did not adversely impact terminal stages of 
spermatogenesis, embryos containing- the recombined allele 
could be efficiently produced. Embryos containing 

recombined nucleic acid constructs can also be produced 
through the activity of Cre nucleic acid- constructs that 

30 are expressed during early embryogenesis from the human 
cytomegalovirus minimal promoter (Schwenk et al . (1995) 
Nucleic Acida Res 23 : 5080-5081) , the adenovirus Ella 
promoter (Lakso et al . (1992) Proc Natl Acad Sci USA 
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89 : 6232-6236), or the zP3 promoter (Lewandoski et al . 
(1997) Curr Biol 7:148-151). ProCre and zP3 nucleic acid 
constructs have the advantage of delivering a recombined 
allele to the zygote, guaranteeing that all cells in the 
5 derived embryos will contain the allele. 

ProCre ES cells are but one of many different kinds of 
recombinase-bearing ES cells that could significantly 
shorten the time and effort required for a wide variety of 
genetic manipulations in mice. The most obvious of these 

10 are complementary ProFLP ES cells in which the FLP 
recombinase was derived from S. cerevisae (Broach and Hicks 
(1980) Cell 21:501-508) or another species (Kuhn et al . 
(1995) Science 269 : 1427-1429) . Conceptually distinct from 
these but perhaps as generically useful would be ES cells 

15 bearing inducible recombinase nucleic acid constructs that 
would facilitate temporal control of recombinase expression 
in ES cells, chimeras, and .their progeny to generate 
site-specifically recombined alleles (Araki et al . (1992) 
J Mol Biol 225:25-37; No et al . (1996) Proc Natl Acad Sci 

20 U S A 93: 334 6-3 3 51 ; Logie and Stewart (1995) Proc Natl Acad 
Sci U S A 92:5940-5944; Feil et al. (1996) Proc Natl Acad 
Sci USA 93:10887-10890) . Finally, fusion genes that led 
to recombinase expression in specific tissues could be used 
to address specific research objectives. 

25 -The invention will now be described in greater detail 

by reference to the following non-limiting examples* 

Example 1 
Mammalian DNA Constructs 

A 652 bp fragment of the mPl promoter (SEQ ID NO:l; 
3 0 Peschon et al. (1989) Annals of the New York Academy of 
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Sciences 186-197) was isolated by PCR using PCR primers 
(SEQ ID NOs:2 and 3) and genomic . DNA templates from CCE ES 
cells (Robertson et al . (1986) Nature 323:445-448). This 
^fragment was fused to a modified Cre coding sequence (SEQ 
5 ,ID NO: 4) which contains a consensus translation start site 
(Kozak (1986) Cell 44:283-292), 11 codons for a human c-myc 
epitope (Evan et al . (1985) Mol Cell Biol 5:3610-3616), 
7 codons for a minimal SV40 nuclear localization signal 
(Kalderon et al . (1984) Cell 39 :499-509) and the 
10 polyadenylation signal from pIC-Cre in the plasmid pOG3 04M 
(SEQ ID NO:5). The Cre expression plasmid pOG231 was 
prepared by fusing a modified Cre coding sequence from 
j>IC-Cre (Gu et al . (1993) Cell 73 :1155-1164) , and 
^containing the same translation start and nuclear 
15 .localization signal, to the synthetic intron and CMV 
-promoter of pOG44 (O' Gorman et al . (1991) Science 
251:1351-13 55) . 

A plasmid, pOG2 77- (SEQ ID NO : 7 ), containing a 
loxP- flanked neomycin cassette was prepared by inserting a 
20 wild- type loxP site (SEQ ID NO: 8; Hoess et al . (1982). Proc 
Natl Acad Sci U S A 79:3398-402) into pBSKS (Stratagene) 
and then, cloning the neomycin expression cassette from 
pMClneo-polyA (Thomas et al . (1987) Cell 51:503-512) 
between interactions of this loxP ■ site. The hoxb-1 
25 targeting, construct consisted; of the PGK-TK cassette from 
pPNT (Tybulewicz et al . (1991) -Cell 65 : 1153-63) and 1.4kb 
and 10.2kb of ^sequences 5 1 and 3 • to an Nru I site' 800 bp 
5 ? to. the hoxb-1 transcriptional start site isolated from 
a 129 strain genomic library (Stratagene); The 
loxP- flanked neo cassette from pOG277« was inserted into the 
Nrul site. The pOG277 neomycin cassette and : a 3 -GAL 
sequence was inserted into the. first exon of the large 
subunit of RNA polymerase II (RP2) (Ahearn et al v (1987) 
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J. Biol. Chem. 262:10695-10705) to create the P2Bc allele 
(Figure 1) . Cre-mediated recombination of the P2Bc allele 
results in the deletion of the neomycin cassette (Neo) of 
P2Bc that is flanked by two loxP sites, leaving a single 
5 loxP site and fusing the B-Gal coding sequence to the 
initial codons of the RNA polymerase II coding sequence . 
Recombination increases the size of a Pst I fragment 
recognized by the RP2 probe, which is external to the 
targeting vector used, indicated by the shaded box below 
10 each allele. 

Example 2 
Production of transgenic mice 

Fertilized oocytes obtained from matings of 129/SvJae 
(Simpson et al . (1997) Nat Genet 16 :19-27) and BALB/c X 

15 C57BL/6 Fl mice were used for pronuclear injections of the 
Protamine - Cre fusion gene from pOG304M according to 
standard protocols (Hogan et al . Manipulating the Mouse 
Embryo: The Manual, Coldspring Harbor Press (1994), pg. 
497). Production of ES cells and homologous recombinants: 

2 0 Heterozygous ProCre. 129/SvJae males were mated to 
129/SvEms-+ Ter? /J females (Simpson et al. (1997) Nat Genet 
16:19-27) to produce blastocysts that were cultured 
according to standard protocols (Robertson (1987) 
Teratocarcinomas aiid embryonic stem cells , a practical 

25 approach, eds. E.J. Robertson (IRL Press), pp. 71-112) . 
The. sex (King et al - (1994) Genomics 24:159-68) and ProCre 
status of each line were determined by PGR assays. 
Molecular analyses: Tail biopsy genomic DNA was used for 
hybridization assays or PCR assays to identify ProCre and 

30 P2Bc/r mice. PCR reactions for the detection of ectopic 
Cre activity used 100 hg of genomic DNA as a template to 
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amplify a P2Br-specif ic product using a 5 1 primer from the 
RP2 promoter and a 3* primer from the 3-GAL coding sequence 
(Figure 1) . Thirty cycles of amplification were done in a 
feotal volume of 100 /zl using 300 ng of each primer, 3 mM 
5 MgCl2, 1.5 units of Taq polymerase , and an annealing 
temperature of 60 °C. Southern blots of reaction products 
were hybridized with a probe specific for the P2Br reaction 
product . 

10 Example 3 

ProCre Nucleic Acid Con structs Efficiently Recnmhinp 

Target Alleles 

r A total of nine founder animals with ProCre nucleic 
acid constructs were obtained from injections of a. 
15 protamine -Cre fusion gene. Two lines were derived from 
injections of 12 9 Sy Jap (Simpson et al . (1997) Nat Genet 
1$: 19.-27) embryos, and seven ; from injections of CB6F2 
embryos. The 129/SvJae lines and three randomly selected 
hybrid lines were examined in detail. To determine whether 
20 ProCre nucleic acid constructs would efficiently recombine 
a target allele, males were generated that contained a 
ProCre nucleic acid construct and a target for Cre-mediated 
recombination., This »P2Bc" (Pol li, £-GAL, conditional) 
target (Figure 1) was created using homologous. 
25 recombination in ES cells to insert, a loxP- flanked neomycin 
cassette and a 3-GAL coding sequence into the first exon of 
the : locus coding for the large subunit of ; RNA 
polymerase II. Cre-mediated recombination of the loxP 
sites was expected to delete the intercalated sequences, 
creating »P2Br» allele (Pol II, ^-Gal, recombined) . 

These males were mated to wild- type females and the 
resulting progeny were examined by Southern blotting to 
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determine if they inherited the P2Bc or the P2Br allele, 
and to additionally determine the segregation pattern of 
ProCre nucleic acid constructs and P2Br alleles. Southern 
blot of Pst I digested tail biopsy DNA's from a +/P2Bc, 
5 +/ProCre male (sire) and four of his progeny by a wild-type 
female probed with n RP2 probe (top) and then reprobed with 
a Cre probe (bottom) . The large majority of transmitted 
target alleles were Cre-recombined P2Br alleles (Table 1) . 
ProCre nucleic acid constructs and recombined target 

10 alleles segregated independently in Jthe first generation; 
approximately 50% of mice that inherited a P2Br allele also 
inherited their male parent's ProCre nucleic acid 
construct. All RP2 mutant alleles in the progeny were 
P2Br, and some progeny inherit a P2Br allele without 

15 inheriting ProCre nucleic acid construct. Mouse 4 did not 
contain a ProCre nucleic acid construct and is homozygous 
wild- type at the RP2 locus. These data establish that 
ProCre nucleic acid constructs efficiently recombine the 
P2Bc allele in the male germline and that the recombined 

20 P2Br alleles and ProCre nucleic acid constructs segregate 
in the first generation. Because significantly more than 
25% of the progeny inherited recombined target alleles, 
recombination either occurred during diploid stages of 
spermatogenesis or Cre generated during haploid stages of 

25 spermatogenesis was distributed among spermatids through 
cytoplasmic bridges (Brauh et al . (1989> : Nature 
337:373-376)/ effecting recombination in spermatids that 
did not themselves contain; a ProCre nucleic acid construct . 

The progeny of mat ings between ProCre males and +/P2Bc 
30 females were also examined to determine if male gametes 
from ProCre mice delivered enough Cre to zygotes to effect 
Cre-mediated recombination of a target sequence. Of 96 
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progeny examined by Southern blotting, none contained a 
Cre-recombined P2Br allele. 

x It has also been discovered that a loxP- flanked neo 
cassette in the glutamate receptor R6 subunit locus is 
5 efficiently recombined by ProCre nucleic acid constructs in 
mice . 

Example 4 

ProCre Nucleic acid co nstruct Expression is Highly 

Tissue - Spec i f i a 

10 r Genomic DNAs from ten different tissues of five- to 
seven-week old males that contained both a ProCre nucleic 
acid construct and a P2Bc target allele were analyzed in 
Southern blots . . Southern blots were prepared of Pst I 
r digested DNA from testes (T) and one . other tissue (K, 

15 kidney; B, brain; . S / spleen) of males:. heterozygous for one 
of four ProCre nucleic acid constructs and the P2Bc allele. 
Testis DNA from each male shows a P2Br allele signal, in 
addition to those generated by the wild- type RP2 (WT) and 
P2Bc alleles. Other tissues show only the WT and P2Bc 

20 signals. Only the testis samples showed signal indicating 
Cre-mediated recombination of the target. The intensity of 
the P2Br signal relative to that of the wild-type allele 
ranged: from 10% to 22% for different* ProCre strains and did 
not correlate with the ProCre nucleic acid construct copy 

25 number. The copy number of ProCre nucleic acid constructs 
varied among lines showing similar levels of recombination 
in testis. For example, restriction patterns and 

densitometric analyses showed that line 58 contained a 
single copy of the ProCre nucleic acid construct, yet 

30 showed virtually the same testis recombination signal as 
line containing more than 100 copies. This variability is 
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similar to results obtained with other mPl promoter -driven 
nucleic acid constructs (Peschon et al . (1987) Proc Natl 
Acad Sc± USA 84 : 5316-5319; Zambrowicz et al . (1993) Proc 
Natl Acad Sci USA 90:5071-5075). 

5 As a more sensitive measure of ectopic recombination, 

PCR amplifications were performed on the same samples. The 
amplification primers were expected to produce a 325 bp 
product from the recombined target and a 1.4 kb fragment 
from the unrecombined allele (Figure!). The assay was 

10 expected to measure the cumulative level of recombination, 
for any P2Br alleles formed during transient expression of 
Cre during development would be preserved and perhaps 
amplified in descendant cells. Low levels of ectopic 
recombination product were observed in some tissues of all 

15 ProCre. lines except for .one ..• A southern blot of PCR 
amplif ication products of the: -P2Br allele 'utilized tissues 
from a male heterozygous for the ProCre nucleic "acid 
construct and the P2Bc allele. DNA from 10 different 
tissues was amplified using primers and conditions that 

20 produced a 3 50 bp product from the recombined, P2Br allele . 
Each lane contains 10% of the reactions except for the 
testis reactions, which were diluted 5001 (T5) , 250 (T2) , 
and 100 (Tl) fold prior to loading, and a liver 
reconstruction control;. (C) , which was diluted 1:100 before 

25 loading. The highest level o£ ectopic " activity was 

observed, in cardiac ventricular muscle of niice ; in these 
samples the ectopic : signal wasr more than - 100 fold lower 
than that observed in testis . Three strains showed much 
lower levels . of recombination in brain tissue, and 

3 0 strain 75 additionally showed ectopic act ivity win; spleen. 
Despite the difficulty of quantifying PCR results > these 
data clearly indicate that ectopic activity Occurred at 
very low levels in most tissues of most ProCre lines. 
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Example 5 

Isolation of Homologously Recombined ProCre ES Cell 
Clones Using Targetin g Vectors with a loxP-Flanked 

Selectable Marker 

Four male +/ProCre ES cell lines were established from 
129/Sv strain ProCre transgenic mice. In preliminary 
experiments, passage 5 cells from one of these lines (PC3) 
were used to generate three male chimeras with between 50 
and 95% coat color chimerism. In matings with C57BL/6 
females, two of these male chimeras have sired a total of 
11 pups,, all bearing the. Agouti coat color signifying 
germline transmission of the ES cell genome, and 6 of 9 
pups genotyped additionally contained the line 70 ProCre 
nucleic acid construct. The frequency of germline 
15 transmission has not yet been determined, nor has it been. 
- • ■ ^determined whether . competency : .for germline transmission 
' . will, persist in homologously recombined ProCre ES icells at 
later passages. 
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To determine, if homologously recombined ProCre ES cell- 
clones, could be isolated using targeting vectors . that 
contained a loxP- flanked- selectable marker/' two 
trans feet ions were done us ing variants of, a targeting 
vector , in which a, . loxP- flanked neomycin - cassette was 
- : inserted . into an Nru I site in the vhoxb-1 : locus: promoter 
25 - .(.Figure 2) . A .Southern blot of - BamHI- digested genomic DNAs 
were harvested from a 96-well plate from 10 doubly-selected 
ES cell clones and hybridized with a probe (shown in Figure 
2) which is external . tb r the targeting ; construct . All 
samples show the 7.5 kb band from the wild- type allele and 
four clones additionally , show the .6 kb. band predicted to 
result from homologous recombination. in these 
transfections, 12 of 62 (19%) PC3- and 10 of 56 (18%) 
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PCs-derived clones that were ganciclovir and G418 -resistant 
(Mansour et al. (1988) Nature 336:348-352) were found to be 
homologously recombined. In two parallel transf ections of 
CCE cells {Robertson et al . (1986) Nature 323: ) with the 
5 same vectors, 32 of 93 {34%) and 15 of 132 (11%) clones' 
were homologously recombined. The total numbers of 
G418-resistant clones recovered from ProCre ES cell 
transf ections were reduced relative to the parallel CCE 
transf ections . This may be attributable to both 

10 Cre-mediated excision of the neomycin*; cassette and to the 
fact that the transf ections were done under electroporation 
conditions optimized for CCE cells. 

Because it was formally possible that the homologously 
recombined clones contained inactive loxP sites, five 

15 homologously recombined PC3 ES cell clones and the parental 
PC3 cell line using the primers shown in Figure 2 - were 
either mock transf ected or transiently transf ected with the 
pOG231 Cre expression vector. For the transient 

transfection assay, DNA was harvested 48 hours after 

20 transfection and used in PCR assays to assess whether the 
loxP sites in the recombinant clones could be recombined by 
Cre. In all cases a clear recombination signal was 
observed in the pOG231 transf ected sample . The recombinant 
clones and parental cell lines show the 204 bp 

25 amplification product of the wild-type allele, and 1 the 
recombinant clones additionally show a 1600 bp product 
(1600) resulting from amplification across the neomycin 
cassette and a nonspecific 1100 bp amplification product 
(NS) . The pOG231- transf ected recombinant clones show an 

30 additional 268 bp product signaling the Cre-mediated 
excision of the neomycin^ cassette from the T recombinant 
alleles of some cells. Experiments were also done to 
assess the stability of the loxP- flanked neo cassette in 
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ProCre ES cells. Five recombinant clones were grown in the 
presence of G418. for two weeks , and then aliquots of each 
were grown either in the presence or absence of G418 for a 
further 10 days, PCR assays were performed to determine if 
5 Cre -re combined alleles were present in any of these samples 
and none was observed in the mock transfected controls. 
These data suggest that there is not enough Cre activity to 
significantly influence either the ability to isolate 
recombinant clones or the stability of the selectable 
10 markers in those clones, establishing that the loxP sites 
in these clones were functional. 

To determine if there was any detectable Cre activity 
in- ProCre ES cells, aliquots of two lines (PC3 and PCS) 
were transiently. transfected with the targeting vector used 
tQ -create the P2Bc allele. DNA was recovered 48 hours 
after transfection 'and^iased^ for PCR ^amplif icat ions- : of the 
P2Br plasmid -molecules r that would-be ^generated by 
extrachromosomal Cre -mediated recombination . Small amounts 
of recombination product were seen in both ProCre ES /cell 
transf ections , and none was observed in parallel samples of 
CCE ES cells . ; This shows that the ProCre ES cell lines 
express sufficient Cre to recombine some extrachromosomal 
targets when the latter are present at : high copy numbers . 
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Example 6 
Plant DNA Constructs 

To define sequences in the LAT52 and LAT59 promoters 
involved in expression in pollen, proximal promoters were 
constructed employing a series of linker substitution 
mutants using the particle bombardment system (Klein et al . 
(1987) Nature 327:70-73; Twell et al . (1989b) Plant Physiol 
91:1270-1274). These experiments were performed by co- 
bombarding the test plasmids (lucif erase [LUC] - recombinase 
fusions) with reference plasmids (S-glucuronidase [GUS] 
fusions) . The latter served as a control for bombardment 
variability and allowed comparisons to be made between 
independent bombardments . 

The context of the -100 promoter in LAT52 and the -115 
15 promoter in LAT59 was chosen because these promoters 
appeared; to be the minimal regions that still conferred 
high levels (25% relative to the available full -length 
promoter) of pollen-specific expression (Twell et al . 
(1991) Gen Dev 5:496-507) . These minimal promoters were 
20 then fused to the Cre coding sequence operatively linked to 
the luc gene (Ow et al . (1986) Science 234.: 856 -858) coding 
region, and the resulting plasmids served as a basis for 
creating the nucleic acid constructs. The LAT52 linker 
substitutions were performed in p52LUC, which contain 
25 entire IAT52 5' untranslated region (5' UTR) . A series of 
six 9- to 10 -bp- long linker substitutions were made in 
p52LUC, spanning the region -84 to -2 9 (52LS1 to 52LiS6) . 
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Example 7 
Tissue Specificity in Plants 

* The results obtained by transient expression in pollen 

z and in transgenic plants provided information on the effect 
5 :: of the various constructs on expression in pollen but not 
: on their effect on tissue specificity. A tobacco cell 
culture, TXD (maintained as described by Howard et al . 
(1992) Cell 68:109-118), was, therefore, added as an 
additional component of the transient assay system. The 
10 TXD cell culture was initiated from tobacco mesophyll cells 
and therefore represents somatic tissue, as opposed to the 
^ game tophy tic tissue represented by pollen. Cells in 
culture were chosen, rather than intact tissue, as the 
somatic tissue source because such cells superficially 
15 jr resemble pollen in that they can be spread out as a 
■ - monolayer on a plate before bombardment. 

In this experiment-, translation fusions between the 
luc coding region and either the CaMV 35S promoter drove 
strong expression in cell culture but negligible expression 

20 in pollen, whereas the LAT52 promoter showed the opposite 
pattern of strong activity in pollen and negligible 
activity in cell culture. Thus, the transient assay system 
mimics the expression pattern observed for these promoters 
in transgenic plants (Twell et al . (1991) Genes Dev 5:496- 

25 507) . This differential expression provided us with a tool 
with which to address tissue specificity. 

Example 8 

Plant Transfor mation and Analysis of Transgenic Plants 

Constructs cloned into pBinl9 were introduced into 
30 tomato (Lycopersicon esculentum cv VF36) by AgroJbacterium 
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tumefaciens LBA4404 as previously described (McCormick 
(1991b) Transformation of tomato with Agrobacterium 
tumefaciens, In Plant Tissue Culture Manual, K. Linsey, Ed 
B6:l-9). At least 20 independent transf ormants were 
5 obtained for each construct , 

For ^-glucuronidase (GUS) assays, 5 to 20 of 
pollen, pooled from several flowers of the same plant, was 
ground directly in Eppendorf tubes in 50 to 100 fiL of GUS 
extraction buffer (Jefferson et al . * (198 7) EMBO 6:3901- 

10 3907) using a Teflon- tipped homogenizer driven by a drill. 
Expression in pollen was measured by f luorometrically 
assaying GUS activity in supernatants of pollen extracts 
using 2mM 4 -methylumbellif eryl S-D-glucuronide (Sigma) as 
substrate (Jefferson et al . (1987) EMBO 6: 3901-3907) . GUS 

15 activity was corrected for variation in total protein 
content using a bicinchoninic acid protein assay kit 
(Pierce, Rockford, IL) . 

Expression in leaves, flowers, stems, roots, ..and seed 
was tested histochemically by staining with 5-bromo-4- 
20 chloro-3 -indolyl S-D-glucuronide (Molecular Probes; Eugene, 
OR) as described previously (Jefferson et al . (19 87) EMBO 
6:3901-3907). Expression in leaves was also analyzed 
f luorometrically as given previously . 

Example 9 • 

25 Transient Transformation of Tobacco Pollen 

and Cell Culture 

Pollen, spread out as a monolayer was bombarded 
essentially as previously described (Twell et al . (i991) 
Genes Dev 5:496-507), except that gold was substituted for 
30 tungsten and only 1 fig of test plasmid and used per plate. 
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TXD cell culture (maintained as described by Howard et al . 
(1992) Cell 68:109-118) was spread out similarly as a 
monolayer (1 mL of a 50 -mL stationary culture per plate) 
and. bombarded as previously described. Between six and 12 
independent bombardments were performed for each construct. 
In each experiment, the test plasmid was co-bombarded with 
a reference plasmid: pB1223 (Clontech, Palo Alto, CA) was 
used for assays of all constructs in tobacco cell culture; 
PLAT59-12 (Twell et al . (1990) Development 109:705-713) for 
assays of LAT52 and LAT56 constructs in tobacco pollen; 
pLAT56-12 (Twell et al. (1990) Development 109 : 705-713 ) for 
assays of LAT59. constructs in tobacco pollen. Processing 
of. the tissue after - 15 to 17 hr and analysis of GUS and 
LUQv activity were as described previously (Twell et al . 
(1991) Genes Dev 5:496-507). Transient expression was 
reported as "relative LUC activity," which represents the 
ratio between the test (LUC) and the reference (GUS) 
plasmids . 

While the Invention has been described in detail with 
reference to certain preferred embodiments thereof, it will 
be understood that modifications and variations are within 
the spirit and scope of that which is described and 
claimed. 
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That which is claimed is : 

1. A nucleic acid construct comprising a germline- 
speicif ic promoter operatively associated with a recombinase 
coding sequence . 

2. A nucleic acid construct according to claim 1 
wherein said germline -specific promoter is the protamine 1 
gene promoter, the protamine 2 gene promoter, the 
spermatid- specif ic promoter from the c-kit gene/ the sperm- 
specific promoter from angiotensin-converting enzyme, 
oocyte specific promoter from the ZP1 gene, oocyte specific 
promoter from the ZP2 gene, or oocyte specific promoter 
from the ZP3 gene. 

3 . A nucleic acid construct according to .claim 1 
wherein said germline-specif ic promoter is the LAT52 gene 
promoter from tomato, the LAT56 gene promoter from tomato, 
the LAT59 gene promoter from tomato, the pollen-specif ic 
promoter of the Brassica S locus glycoprotein gene, or the 
pollen-specific promoter of the NTP303 gene. 

4 . A nucleic acid construct according to claim I 
wherein said recombinase coding sequence encodes Cre 
recombinase. 

5. A nucleic acid construct according to claim 4 
wherein said construct is ProCre, comprising the protamine 
1 gene promoter operatively associated with Cre 
recombinase . 
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6 . A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes FLP 
recombinase . 

7. A nucleic acid construct according to claim 6 
wherein said construct is ProFLP, comprising the protamine 
1 gene promoter operatively associated with FLP 
recombinase. 

8 . A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes the R gene 
product of. Zygosaccharomyces . 

9- A nucleic acid construct according to claim 8 
wherein said construct is ProR, comprising the protamine 1 
gene promoter operatively .. associated with the R gene 
product of Zygosaccharomyces . 

10. A nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence . 

11. A nucleic acid construct comprising a tissue- 
specific promoter operatively. associated with a recombinase 
coding sequence. 

12. Embryonic stem cells containing a nucleic acid 
construct according to claim 1. 

13. Embryonic stem cells according to claim 12 
wherein the genome thereof comprises a transcriptionally 
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active selectable marker flanked by two recombination 
target sites. 

14. Embryonic stem cells according to claim 13 
wherein the recombinase encoded by the recombinase coding 
sequence operatively associated with a germline-specif ic 
promoter is selective for the recombination target sites 
flanking said selectable marker. 

15. Embryonic stem cells according to claim 13 
further comprising one or more of : 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
different than the recombination target sites which flank 
said selectable marker , v . 

a nucleic acid construct comprising a ■ conditional 
promoter operatively associated with a recombinase coding 
sequence, or 

a nucleic acid construct comprising a tissue-specific 
promoter operatively associated with a recombinase coding 
sequence . 

16. Embryonic stem cells containing a nucleic acid 
construct according to claim 2 . ^ " . . - 

17. Embryonic stem cells containing a nucleic acid 
construct according to claim 3. 

18. Embryonic stem cells containing a nucleic acid 
construct according to claim 4 . 
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19. Embryonic stem cells containing a nucleic acid 
construct according to claim 5. 

;: 20. Embryonic stem cells containing a nucleic acid 
construct according to claim 6 . 

21. Embryonic stem cells containing a nucleic acid 
construct according to claim 7 . 

22 . Embryonic stem cells containing a nucleic acid 
construct according to claim 8. 

23. Embryonic stem cells containing a nucleic acid 
construct according to claim 9. 

• • : ; :: ~ 24. - Embryonic stem cells containing . a . nucleic acid 
construct according to claim 10 . 

25 . Embryonic stem cells according to claim 2-4 
wherein the genome thereof comprises a transcriptionally 
active selectable marker flanked by two recombination 
target sites. 

26. Embryonic stem cells containing a nucleic acid 
construct according to claim 11. 

27. Embryonic stem cells according to claim 26 
wherein the genome thereof comprises a transcriptionally 
active selectable marker flanked by two .recombination 
target sites. 
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28. A method for excission of the transcriptionally 
active selectable marker from the embryonic stem cells of 
claim 13, said method comprising: 

passaging the genome derived from said embryonic stem 
cells through gametogenesis. 

29. A method according to claim 28 wherein said 
genome is passaged through spermatogenesis . 

30. A method according to claim 28 wherein said 
genome is passaged through oogenesis. 

31. A method according to claim 28 wherein said 
embryonic stem cells further comprise one or more of: 

a nucleic acid fragment flanked by. two recombination 
target sites, .wherein said .recombination target sites are 
different than the recombination target sites which flank 
said selectable marker, 

a nucleic acid construct comprising , a conditional 
promoter operatively associated with a recombinase coding 
sequence, or 

a nucleic acid construct comprising a tissue-specific 
promoter operatively associated with a recombinase coding 
■sequence.. • • ; /• 

32 . A method for the . production of recombinant 
alleles, said method comprising: 

introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
cells according to claim 10, and 
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passaging the genome derived from said embryonic stem 
cells through gametogenesis. 

33 • A method according to claim 32 wherein said 
.nucleic acid fragment comprises an essential portion of a 
gene of interest . - 

34. A method according to claim 32 wherein said 
nucleic acid fragment is introduced by homologous 
recombination, random insertion, retroviral insertion, or 
site specific-mediated recombination. 

■35. A method for the production of recombinant 
^alleles, said method comprising; 

.introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
cells according to claim '13 , and 

passaging the genome derived from said embryonic stem 
cells through gametogenesis. 

36. A method according to claim 35 wherein said 
embryonic stem cells further comprise a second nucleic acid 
construct selected from the group consisting of a- construct 
comprising a conditional promoter operatively associated 
with a recombinase coding sequence and a construct 
comprising a tissue-specific promoter operatively 
associated with a recombinase coding, sequence . 

37. A method according to claim 36 wherein the 
recombinase encoded by said second construct is expressed 
in response to inducing conditions. 
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38. A method according to claim 36 wherein the 
recombinase encoded by said second construct is expressed 
in a tissue selective manner. 

39. A method according to claim 35 wherein the 
recombination target sites flanking said nucleic acid 
fragment are recognized by a recombinase which is expressed 
under the control of a conditional promoter or a tissue 
specific promoter, x. 

40. A method for the production of recombinant 
alleles, said method comprising: 

introducing at least one recombinase responsive construct 
into embryonic stem cells according to claim 10, 

wherein said construct (s) comprise (s) a nucleic 
acid fragment and a selectable marker, 

wherein said selectable marker is flanked by a 
first pair of recombination target sites, and 

wherein said nucleic acid fragment is flanked by 
a second pair of recombination target sites, 

passaging the genome derived from said embryonic stem cells 
through gametogenesisv 

41. A method according to claim 40 wherein said first 
pair of recombination target sites is recognized by a 
recombinase which is expressed under the control of a 
germline-specif ic promoter and said second pair of 
recombination target sites is recognized by a recombinase 
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which is expressed under the control of a conditional 
promoter or a, tissue specific promoter. 

42. A method according to claim 40 wherein said 
embryonic stem cells further comprise a second nucleic acid 
construct selected from the group consisting of a construct 
comprising a conditional promoter operatively associated 
with a recombinase coding sequence and a construct 
comprising a tissue -specific promoter operatively 
associated with a recombinase coding sequence. 

z 43 * A method for the conditional assembly of 
functional genets) for . expression in eukaryotic cells by 
reqombination of individual inactive gene segments from one 
or j more gene(s) of interest, 

• wherein each of said segments contains at least one. 
recombination target site, and 

wherein at least one of said segments contains at 
least two recombination target sites, 

said method comprising: 

introducing said individual inactive gene 
segments into an embryonic stem cell according to 
claim 10, thereby providing a DNA which encodes a 
functional gene of interest, the expression product of 
which is, biologically aqtive, upon passage of the 
genome derived from said stem cells through 
gametogenesis . 

44 . A method for the generation of recombinant 
livestock, said method comprising: 
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combining embryonic stem cells that include a nucleic 
acid construct according to claim 1 with host 
pluripotential ES cells derived from early preimplantation 
embryos, and 

introducing these combined embryos into a host female 

and 

allowing the derived embryos to come to term. 

45. A method for the generation of recombinant 
plants, said method comprising transforming plant zygotes 
with nucleic acid constructs according to claim 1 and 
allowing the zygote to develop. 
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SEQUENCE LISTING 

<110> O* Gorman, Steve 
Wahl, Geoffrey 

<120> Site-Specific Germline Recombination in 
Eukaryotes and Constructs Useful Therefor 

<130> Salk2190 

<150> 08/919,501 
<151> 1997-08-28 

<160> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 652 
<212> DNA 

<213> Mus musculus 
<400> 1 

gtctagtaat gtccaacacc tccctcagtc caaacactgc tctgcatcca tgtggctccc 60 

atttatacct gaagcacttg atggggcctc aatgttttac tagagcccac ccccctgcaa 120 

ctctgagacc ctctggattt gtctgtcagt gcctcactgg ggcgttggat aatttcttaa 180 

aaggtcaagt tccctcagca gcattctctg agcagtctga agatgtgtgc tttcacagtt 240 
acaaatccat gt^gctgttt cacccacctg cctggccttg ,,ggttatctat . caggacctag . .3 00 
cctagaagca ggtgtgtggc acttaacacc taagctgagt gactaactga lacactcaagt ' 360 
ggatgccatc" tttgt:cactt ct tgactgtg acacaagcaa ; \ 2 Q 

cccacccctc ^tcatgcccat a t ttggaca t • .gg t acaggt c ' c tcac t^cc- ^atig^ciij^ 480 
aggtcctggt cctctttgac ttcataattc ctaggggcca" ctagtatcta ■ taagaggaagV 540 

agggtgctgg ctcccaggcc acagcccaca aaattccacc tgctcacagg ttggctggct 600 
cgacccaggt ggtgtcccct gctctgagcc agctcccggc caagccagca cc .652 

<21v>> 2 
<211> 29 
<212> DNA '' 

<213> Artificial Sequence 

■ <400> 2 '* ' 

gtctagtaat gtccaacacc tccctcagt v 29 

<210>\3 ' : ' 

*<2ii>. ii ' r 

"' <2125' DMA '' " "J 
<213> r Artificial Sequence : ". - 

' <400> 3 ^ ' ' ; 

ctctgagccagctcccggcc aagccagcac c 31 

<210> 4' 

<211> 1022 > 
<212> DNA 

<213> Artificial Sequence 
<400> 4 

atggagcaaa agctgatttc tgaggaggat ctgggaggac ccaagaagaa gaggaaggtg 60 
tccaatttac tgaccgtaca ccaaaatttg cctgcattac cggtcgatgc aacgagtgat 120 
gaggttcgca agaacctgat ggacatgttc agggatcgcc aggcgttttc tgagcatacc 180 
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tggaaaatgc ttctgtccgt ttgccggtcg tgggcggcat ggtgcaagtg aataaccgga 240 

aatggtttcc cgcagaacct gaagatgttc gcgattatct tctatatctt caggcgcgcg 3 00 

gtctggcagt aaaaactatc cagcaacatt tgggccagct aaacatgctt catcgtcggt 3 60 

ccgggctgcc acgaccaagt gacagcaatg ctgtttcact ggttatgcgg cggatccgaa 420 

aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt 480 

tcgaccaggt tcgttcactc atggaaaata gcgatcgctg ccaggatata cgtaatctgg 540 

catttctggg gattgcttat aacaccctgt tacgtatagc cgaaattgcc aggatcaggg 600 
ttaaagatat ctcacgtact gacggtggga gaatgttaat ccatattggc agaacgaaaa - 660 

cgctggttag caccgcaggt gtagagaagg cacttagcct gggggtaact aaactggtcg 720 

agcgatggat ttccgtctct ggtgtagctg atgatccgaa taactacctg ttttgccggg 780 

tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca gctatcaact cgcgccctgg 840 

aagggatttt tgaagcaact catcgattga tttacggcgc taaggatgac tctggtcaga 900 

gatacctggc ctggtctgga cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg 960 

ctggagtttc aataccggag atcatgcaag ctggtggctg gaccaatgta aatattgtca 1020 

tg 1022 

<210> 5 

<211> 2293 " •• 

<212> DNA 

<213> Artificial Sequence 
<400> 5 

gtctagtaat gtccaacacc tccctcagtc caaacactgc tctgcatcca tgtggctccc 60 

atttatacct gaagcacttg atggggcctc aatgttttac tagagcccac ccccctgcaa 120 

ctctgagacc ctctggattt gtctgtcagt gcctcactgg ggcgttggat aatttcttaa 180 

aaggtcaagt tccctcagca gcattctctg agcagtctga agatgtgtgc tttcacagtt 240 

acaaatccat gtggctgttt cacccacctg cctggccttg ggttatctat caggacctag 300 

cctagaagca ggtgtgtggc acttaacacc taagctgagt gactaactga acactcaagt 36 0 
ggatgccatc tttgtcactt cttgactgtg acacaagcaa ctcctgatgc caaagccctg -420 
cccacccctc tcatgcccat atttggacat ggtacaggtc ctcactggcc atggtctgtg r 480 
aggtcctggt cctctttgac ttcataattc ctaggggcca ctagtatcta taagaggaag / 540 

agggtgctgg ctcccaggcc acagcccaqa aaattccacc tgctcacagg ttggctggct .600 

cgacccaggt ggtgtcccct gctctgagcc agctcccggc caagccagca cccgggacca .660 

tggagcaaaa gctgatttct gaggaggatc tgggaggacc caagaagaag aggaaggtgt 720 

ccaatttact gaccgtacac caaaatttgc ctgcattacc ggtcgatgca acgagtgatg 780 

aggttcgcaa gaacctgatg gacatgt'tca gggatcgcca ggcgttttct gagcatacct 840 

ggaaaatgct tctgtccgtt tgccggtcgt gggcggcatg gtgcaagttg aataaccgga 900 

aatggtttcc cgcagaacct gaagatgttc gcgattatct tctatatctt caggcgcgcg 960 

gtctggcagt aaaaactatc cagcaacatt tgggccagct aaacatgctt catcgtcggt 1020 

ccgggctgcc acgaccaagt gacagcaatg ctgtttcact ggttatgcgg cggatccgaa 1080 

aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt 1140 

tcgaccaggt tcgttcactc atggaaaata gcgatcgctg ccaggatata cgtaatctgg 1200 

catttctggg gattgcttat aacaccctgt tacgtatagc cgaaattgcc aggatcaggg 1260 

ttaaagatat ctcacgtact gacggtggga gaatgttaat ccatattggc agaacgaaaa 1320 

cgctggttag caccgcaggt gtagagaagg cacttagcct gggggtaact aaactggtcg 13 80 

agcgatggat ttccgtctct ggtgtagctg atgatccgaa taactacctg ttttgccggg 1440 

tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca gctatcaact cgcgccctgg 1500 

aagggatttt tgaagcaact catcgattga tttacggcgc taaggatgac tctggtcaga 1560 

gatacctggc ctggtctgga cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg 1620 

ctggagtttc aataccggag atcatgcaag ctggtggctg gaccaatgta aatattgtca 1680 

tgaactatat ccgtaacctg gatagtgaaa caggggcaat ggtgcgcctg ctggaagatg 1740 

gcgattagcc attaacgcgt aaatgattgc tataattatt tgatatttat ggtgacatat 1800 

gagaaaggat ttcaacatcg acggaaaata tgtagtgctg tctgtaagca ctaatattca 1860 

gtcgccagcc gacattgtca ctgtaaagct gagcgataga atgcctgata ttgactcaat 1920 

atccggtgcg tttcctgtca aaagtatgcg tagtgctgaa catttcgcga tgaatcccac 1980 

cgaggaagaa gcacggcgcg gttttgctaa agtgatgtct gagtttggcg aactcttggg 2040 

taaggttgga attgtcgagg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta 2100 

cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc gtgctttacg 2160 

gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac gagttcttct 2220 

gaggggatcg gcaataaaaa gacagaataa aacgcacggg tgttgggtcg tttgttcgga 2280 

tcgatccgtc gac 2293 
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<210> 6 
<2X1> 86 
<212> DNA 

<213> Artificial Sequence 
<400> 6 

ccqgggatca attcaccatg ggaataactt cgtatagcat acattatacg aagttatgga 
tccgccgcta tcaggacata gcgttg 

Z <210> 7 

<211> 4172 
]]' <212> DNA 

<213> Artificial Sequence 

<400> 7 

gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 
atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 
agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 
ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 
gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 
gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 
tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 
act^tggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 
aa |A at ST ca 9 tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 
cga^tcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 
gcgttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 
cg^fgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 
tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca : ggaccacttc 
tg^gctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 
ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgfea^tta 
■■ tctacacgac ggggagtcag gcaactatgg a tgaacgaaa^tagac agate get gaga tag 
gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 
ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 
tcatgaccaa aatcccttaa cgtgagtttt cgttccac.tg agegtcagae cccgtagaaa 
agatcaaagg atcttcttga: gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 
aaaaaccacc gctaccagcg gtggtttgtt tgeeggatea agagctacca. actctttttc 
cgaaggtaac tggcttcagc agagegcaga taccaaatac tgtccttcta gtgtagccgt 
agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 
tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 
gatagttacc ggataaggcg cageggtegg getgaaeggg gggttcgtgc acacagccca 
gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 
ccacgcttcc cgaagggaga aaggeggaca ggtatccggt aageggcagg gtcggaacag 
gagagegcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 
ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg eggagectat 
ggaaaaaege cagcaacgcg gectttttae ggttcctggc ettttgetgg ccttttgctc 
acatgttctt tectgegtta tcccctgatt ctgtggataa ccgtattacc gectttgagt 
gagctgatac cgctcgccgc agecgaaega ccgagcgcag cgagtcagtg agegaggaag 
eggaagageg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgea 
gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga 
gttagctcac. tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt 
gtggaattgt gageggataa caatttcaca caggaaacag ctatgaccat gattacgeca 
agctcgaaat taaccctcac taaagggaac aaaagctggg tacgaattca gatctcccgg 
gatcaattca ccatgggaat aacttegtat agcatacatt atacgaagtt atggatccgg 
tcgagcagtg tggttttgca agaggaagca aaaagectet ccacccaggc ctggaatgtt 
tccacccaat gtcgagcagt gtggttttgc aagaggaagc aaaaagcetc tccacccagg 
cctggaatgt ttccacccaa tgtcgagcaa accccgccca gcgtcttgtc attggcgaat 
tcgaacacgc agatgeagtc ggggeggege ggtcccaggt ccacttcgca tattaaggtg 
acgcgtgtgg cctcgaacac cgagcgaccc tgcagccaat atgggatcgg ccattgaaca 
agatggattg caegcaggtt ctccggccgc ttgggtggag aggctattcg gctatgactg 
ggcacaacag acaategget getctgatge cgccgtgttc cggctgtcag cgcaggggcg 
cccggttctt tttgtcaaga ccgacctgtc cggtgccctg aatgaactgc aggacgaggc 
agegeggcta tcgtggctgg ccacgacggg cgttccttgc gcagctgtgc tcgacgttgt 
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cactgaagcg ggaagggact ggctgctatt gggcgaagtg ccggggcagg atctcctgtc 2880 

atctcacctt gctcctgccg agaaagtatc catcatggct gatgcaatgc ggcggctgca 2940 

tacgcttgat ccggctacct gcccattcga ccaccaagcg aaacatcgca tcgagcgagc 3000 

acgtactcgg atggaagccg gtcttgtcga tcaggatgat ctggacgaag agcatcaggg 3 060 

gctcgcgcca gccgaactgt tcgccaggct caaggcgcgc atgcccgacg gcgaggatct 3120 

cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg gtggaaaatg gccgcttttc 3180 

tggattcatc gactgtggcc ggctgggtgt ggcggaccgc tatcaggaca tagcgttggc 3 240 

tacccgtgat attgctgaag agcttggcgg cgaatgggct gaccgcttcc tcgtgcttta 33 00 

cggtatcgcc gctcccgatt cgcagcgcat cgccttctat cgccttcttg acgagttctt 3360 

ctgaggggat cggcaataaa aagacagaat aaaacgcacg ggtgttgggt cgtttgttcg 342 0 

gatagggatc aattcaccat gggaataact tcgtatagca tacattatac gaagttatgg 34 80 

atccactagt tctagagcgg ccgccaccgc ggtggagctc caattcgccc tatagtgagt 3540 

cgtattacaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 3600 

cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 3660 

cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 3 720 

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 3780 

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctdgcc acgttcgccg 3840 

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 3900 

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 3 960 

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4020 

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4080 

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 414 0 

ttaacaaaat attaacgctt acaatttagg tg 4172 

<210> 8 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<400> 8 
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